Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 1022420110030010087
Phonetics and Speech Sciences
2011 Volume.3 No. 1 p.87 ~ p.94
Two-step a priori SNR Estimation in the Log-mel Domain Considering Phase Information
Lee Yun-Kyung

Kwon Oh-Wook
Abstract
The decision directed (DD) approach is widely used to determine a priori SNR from noisy speech signals. In conventional speech enhancement systems with a DD approach, a priori SNR is estimated by using only the magnitude components and consequently follows a posteriori SNR with one frame delay. We propose a phase-dependent two-step a priori SNR estimator based on the minimum mean square error (MMSE) in the log-mel spectral domain so that we can consider both magnitude and phase information, and it can overcome the performance degradation caused by one frame delay. From the experimental results, the proposed estimator is shown to improve the output SNR of enhanced speech signals by 2.3 §¼ compared to the conventional DD approach-based system.
KEYWORD
phase modeling, speech enhancement, speech separation, MMSE, decision-directed, a priori SNR
FullTexts / Linksout information
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI)